On using voice source measures in automatic gender classification of children's speech
نویسندگان
چکیده
Acoustic characteristics of speech signals differ with gender due to physiological differences of the glottis and the vocal tract. Previous research [1] showed that adding the voice-source related measuresH∗ 1 −H∗ 2 andH∗ 1 −A3 improved gender classification accuracy compared to using only the fundamental frequency (F0) and formant frequencies. H∗ i refers to the i–th source spectral harmonic magnitude, and Ai refers to the magnitude of the source spectrum at the i–th formant. In this paper, three other voice source related measures: CPP, HNR and H∗ 2 −H∗ 4 are used in gender classification of children’s voices. CPP refers to the Cepstral Peak Prominence [2], HNR refers to the harmonic-to-noise ratio [3], andH∗ 2 −H∗ 4 refers to the difference between the 2nd and the 4th source spectral harmonic magnitudes. Results show that using these three features improves gender classification accuracy compared with [1].
منابع مشابه
Voice-based Age and Gender Recognition using Training Generative Sparse Model
Abstract: Gender recognition and age detection are important problems in telephone speech processing to investigate the identity of an individual using voice characteristics. In this paper a new gender and age recognition system is introduced based on generative incoherent models learned using sparse non-negative matrix factorization and atom correction post-processing method. Similar to genera...
متن کاملA Comparative Study of Gender and Age Classification in Speech Signals
Accurate gender classification is useful in speech and speaker recognition as well as speech emotion classification, because a better performance has been reported when separate acoustic models are employed for males and females. Gender classification is also apparent in face recognition, video summarization, human-robot interaction, etc. Although gender classification is rather mature in a...
متن کاملIntelligent Voice Recognition System Based on Acoustic and Speaking Fundamental Frequency Characteristics
Speech recognition is a fascinating application of Digital Signal Processing and has many real-world applications. In this paper, a speech recognition system is developed for isolated spoken words using Discrete Wavelet Transforms (DWT) and Artificial Neural Networks (ANN). Speech signals are one-dimensional and are random in nature. This paper investigates Automatic Speech Recognition of gende...
متن کاملAdult Voice Recognition System using Text Variable Phoneme Model and Coarse Speaking Fundamental Frequency Characteristics
-------------------------------------------------------Abstract--------------------------------------------------------Speech recognition is a fascinating application of Digital Signal Processing and has many real-world applications. In this paper, a speech recognition system is developed for isolated spoken words using Discrete Wavelet Transforms (DWT) and Artificial Neural Networks (ANN). Spe...
متن کاملChildren's voice and voice disorders.
This article discusses the differences between children's voices and adult voices. We give an overview of the anatomy in the head and neck and specifically the anatomy of the respiratory system and the larynx. We also describe the development of children's voices including different physiological measures and voice quality. The development and consequences for voice production and voice quality...
متن کامل